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S Abstract 

We present three new sets of next-to-leading order parton distribution functions (PDFs) deter- 
O ■ mined by global fits to a wide variety of data for hard scattering processes. The analysis includes 

target mass and higher twist corrections needed for the description of deep inelastic scattering data 
at large x and low Q 2 , and nuclear corrections for deuterium targets. The PDF sets correspond 
to three different models for the nuclear effects, and provide a more realistic uncertainty range for 

a? 

the d quark PDF, in particular, compared with previous fits. We describe the PDF error sets for 
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each choice of the nuclear corrections, and provide a user interface for utilizing the distributions. 
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I. INTRODUCTION 



Parton distribution functions (PDFs) are necessary ingredients for any calculation of 
a hard scattering process involving hadrons in the initial state. Deeply-inelastic lepton- 
hadron scattering (DIS) experiments provide data which yield information on PDFs over a 
wide range of parton light-cone momentum fraction x and four-momentum transfer squared 
Q 2 . Traditional global fits focus on the extraction of the leading twist PDFs, utilizing cuts 
on Q 2 and the hadronic final state mass squared W 2 = M 2 + Q 2 (l — x)/x, where M is the 
nucleon mass. These cuts, which typically are of the order Q 2 > 4 GeV 2 and W 2 > 14 GeV 2 , 
are designed to eliminate regions where effects that do not scale with Q 2 may be important. 
At any finite value of Q 2 , the available range in x is therefore kinematically limited by 
x < x max = Q 2 /{W 2 — M 2 + Q 2 ). Available DIS data sets have few points above x w 0.7 
when these types of cuts are applied. Other types of data such as lepton pair production 
or high-pr jet production are also generally limited to regions which are sensitive to PDF 
momentum fraction values x < 0.7, or have limited statistics. There are few constraints, 
therefore, on the large- x behavior of PDFs, and the results of global fits using these cuts 
should be considered as extrapolations when evaluated above this value of x. 

Aside from the intrinsic interest in the behavior of PDFs at large values of x l|-[5) , there 
are also important practical aspects to consider. The production of a state of mass m 
at rapidity y in proton-proton collisions at a center-of-mass energy ^/s requires knowledge 
of PDFs at x ~ (m/i/i)e ±s/ at a scale of order m. High mass states produced in the 
forward scattering region at large positive values of y therefore entail the product of one 
PDF evaluated at a large value of x with another at a small x. Furthermore, PDFs at 
large x and small factorization scale Q evolve via the QCD evolution equations to lower x 
and higher Q, so that large- a; uncertainties in fixed-target experiments can have significant 



consequences for collider measurements 



4121- 

In an effort to obtain constraints on PDFs at large values of x, a study was made by 
Accardi et al. in which the Q 2 and W 2 cuts applied to DIS data were systematically 
relaxed, thereby allowing data at higher x and lower Q 2 to be used. This necessitated the 
inclusion of various corrections subleading in 1/Q 2 , such as target mass corrections and 
higher twist contributions, which become increasingly important as Q 2 is lowered and x 
tends to 1. 
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A goal of global PDF fits is to determine the x and Q 2 dependence of the different parton 
species (gluons, individual quark and antiquark flavors). Charged-lepton DIS experiments 
on proton targets are sensitive at large x to the combination 4m + d, but to separate the 
u and d PDFs an additional constraint is needed. Data on a free neutron target at large 
values of x would constrain 4d + u; however, the closest one can come to this is to utilize a 
deuterium target. Since the proton and neutron in deuterium are not free, it is necessary to 
account for nuclear effects such as Fermi motion, binding, and nucleon off-shell corrections, 
which can be significant at large values of x. Estimates of the uncertainties due to different 
models of these nuclear corrections were studied in Ref. {9}. 

The purpose of the present paper is to synthesize the lessons of the previous studies 
8|, |9j, and present three sets of global next-to- leading order (NLO) PDF fits, each of which 
differs by the size of the nuclear corrections employed for the deuteron target DIS data. 
The results are referred to collectively as the CJ12 PDFs (with "CJ" denoting the CTEQ- 
Jefferson Lab collaboration lfj|), and the three PDF sets with mild, medium and strong 
nuclear corrections are designated as "CJ12min", "CJ12mid" and "CJ12max", respectively. 
These PDFs provide a convenient way to explore the effects of the nuclear corrections on 
various observables. 

In the following section we describe the methodology involved in the fits, including the 
choices of data sets, the parametrizations used, and the treatment of nuclear and finite-Q 2 
corrections. We also discuss the treatment of the experimental errors and the resulting error 
PDF sets. In Sec. II III we present an overview of the CJ12 PDF results, and compare these 
with results from other global fits. Finally, in Sec. II VI we make some concluding remarks and 
outline plans for future work. An appendix is provided which contains the initial parameter 
values for each of the PDF sets. 



II. FORMALISM AND PROCEDURES 

The CJ12 PDF sets described herein result from global fits to a wide variety of data 
totalling 3958 data points. The global fitting procedures utilize NLO calculations for each 
observable. For the DIS observables, higher twist contributions and target mass corrections 
are included, and for the deuterium DIS data sets nuclear corrections are taken into account, 
as described in this section. 
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A. Data 



The data sets used, listed in Table fl] encompass data ranging from medium energy 
electron-hadron scattering at Jefferson Lab to high-energy hadron-hadron scattering at the 
Fermilab Tevatron. The kinematic cuts employed in this analysis, Q 2 > Qq = (1.3 GeV) 2 



and W 2 > 3 GeV 2 , allow the inc 
and deuterium from BCDMS 



usion of: Fo structure function measurements on 



rydrogen 



Uj, SLAC [14j, NMC (12,1131, and Jefferson Lab [15|; neutral 



current (NC) and charged current (CC) DIS cross sections from HERA 



16 



from Fermilab 



171 ]; W (and decay lepton) asymmetries from CDF and D0 



; Drell-Yan data 



18 



J22I]; Z rapid- 



ity distributions from CDF and D0 23j, |24j; jet production cross sections from both CDF 
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26], and D0[27J, |28|, and photon plus jet production from D0[29|. 



DIS data with a deuterium target are necessary for u and d quark flavor separation at 
large x, which entails the use of nuclear corrections, as will be discussed and evaluated 
below. Unlike global fits from other groups, the present data set does not include the 
NuTeV dimuon data 3(| from neutrino-iron scattering, which provide essentially the only 
currently available source of information on the s and s quark distributions (except possibly 
for W and Z production at the LHC [3l|). The reason is that, beside corrections due to 
modifications of medium- £ quark PDFs in the heavy nuclear target, which are generally not 



too large 



32h36|. the uncertainty due to in- medium propagation of the tagged final state 



charm quark and D meson may be significant 37j. Similarly, we do not use E605 data on 
dimuon production from a copper target 38J, and rely on E866 data 17J] only to constrain 
antiquark distributions. 

The x and Q 2 coverage of these data is illustrated in Fig. [Tj where leading order kinematics 
was used according to the procedure detailed in Ref. 39]. While nominally there is a 



substantial amount of data at large x from vector boson production measurements, only 
the directly reconstructed charge asymmetry data from the CDF collaboration has 
a sufficient statistical precision to constrain the d quark distribution uncertainty to the 
level aimed for in this study. Note that the lepton charge asymmetry data also have high 
precision; however, the lepton decay smears the primary W production process, resulting 
in an effective large- X reach smaller than that for the W asymmetry. The remaining source 
of large- a; data is DIS, for which the kinematic cuts used in this work are comparable to 
those in the global fits of Alekhin et al. 



40 



411 ]. but lower than the typical cuts used in all 
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TABLE I: Data sets and number of data points used in the global fits, together with the \ 2 values 
for each of the CJ12min, CJ12mid and CJ12max fits. 













X 2 






Experiment 


Ref. # points 


CJ12min 


CJ12mid 


CJ12max 


DIS F 2 


BCDMS (p) 




351 


434 


436 


437 




BCDMS (d) 


mi 


254 


294 


297 


302 




NMC (p) 


[12] 


275 


434 


432 


430 




NMC (d/p) 


[13] 


189 


179 


177 


182 




SLAC (p) 


[14] 


565 


456 


455 


456 




SLAC (d) 


[14] 


582 


394 


388 


396 




JLab (p) 


[15] 


136 


170 


169 


170 




JLab (d) 


[15] 


136 


124 


125 


126 


DIS a 


HERA (NC e~) 


yj 


145 


117 


117 


118 




HERA (NC e+) 


[16] 


384 


595 


596 


596 




HERA (CC e - ) 


[16] 


34 


19 


19 


19 




HERA (CC e+) 


[16] 


34 


32 


32 


32 


Drell-Yan 


E866 (p) 


[17] 


184 


220 


221 


221 




E866 (d) 


[17] 


191 


297 


307 


306 


W asymmetry 


CDF 1998 (£) 


[18] 


11 


14 


16 


18 




CDF 2005 (£) 


[19] 


11 


11 


11 


10 




D02OO8 (£) 


[20] 


10 


4 


1 


4 




D02OO8 (e) 


[21] 


12 


40 


36 


34 




CDF 2009 (W) 


[22] 


13 


20 


25 


41 


Z rapidity 


CDF (Z) 


[23] 


28 


29 


27 


27 




D0(Z) 


[24J 


28 


16 


16 


16 


jet 


CDF run 1 


[25] 


33 


52 


52 


52 




CDF run 2 


[26] 


72 


14 


14 


14 




DOrun 1 


[27] 


90 


21 


20 


19 




DOrun 2 


[28] 


90 


19 


19 


20 


7+jet 


D01 


[29] 


16 


6 


6 


6 




D02 


[29] 


16 


13 


13 


12 




D03 


[29] 


12 


17 


17 


17 




D04 


[29] 


12 


17 


16 


17 


TOTAL 




3958 


4059 


4055 


4096 


TOTAL + norm 






4075 


4074 


4117 



other other global PDF analyses 42|-|46|. The improvement in the large-x kinematic reach 
is significant, amounting to about 1300 more data points roughly equally divided between 
proton and deuteron targets, with adequate precision. This represents some ~ 50% increase 
in the total number of DIS data points compared with the standard cuts. 
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FIG. 1: Nominal coverage of the data sets used in the CJ12 global fits. Top row : vector boson and 
jet production. In the vector boson data plot, My is the mass of the boson, and the weak boson 
data have been offset vertically for clarity. For the E866 dimuon data, the 3-pointed star and open 
triangle symbols correspond to the target and projectile x values, respectively. In the jet data plot, 
Pt is the transverse momentum of the jet. Bottom row : DIS data on a logarithmic and linear x 
scale. In the latter, the W 2 > 14 GeV 2 cut typically used in global PDF fits and the W 2 > 3 GeV 2 
cut used in this analysis are indicated by the dashed and dot-dashed lines, respectively. 

B. PDF parametrization 

For the parametrization of the PDFs at the input scale Qq, a common form has been 
adopted for all parton species /, 

xf(x, Ql) = a x ai (l - x) a ' 2 (l + a 3 y/x + a 4 x). (1) 
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This form applies to the valence distributions xq v = x(q — q), for q = u and d, the isoscalar 
and isovector sea quark distributions x(u + d) and x(d — u), and the gluon distribution xg. 
However, to allow for a more flexible parametrization of the valence d v PDF in the large-x 
region, we add in a small admixture of the u v PDF, 

dv^aft (^ + bx c u v y (2) 

with b and c as two additional parameters. The result of this modification is that d v /u v — > 
<1q v b as x — > 1, provided > a^ v , which is usually the case. A finite, nonzero value of 
this ratio is indeed expected in several nonperturbative models of hadron structure ^\^. It 
is also required from a purely practical point of view, as it avoids potentially large biases 
on the c?-quark PDF central value jsj], as well as on its PDF error estimate, as we discuss 
in detail in Sec. IHIl The ao parameters for the u v and d v distributions are fixed by the 
appropriate valence quark number sum rules, while a 9 Q is fixed by the momentum sum rule. 
Finally, since we do not use neutrino-nucleus DIS data in the fits, the strange quark PDF 
is assumed to be proportional to the isoscalar light quark sea, and parametrized as 



The possibi 



models 



47 



(s + s) = K(u + d). (3) 

ity of asymmetric s and s distributions, predicted in several nonperturbative 
481 ]. can be investigated using dimuon data in neutrino and antineutrino DIS 



301 ] . and will be the subject of future analysis including nuclear corrections for heavy nuclei. 
The parameter values for each of the three PDF sets (CJ12min, CJ12mid and CJ12max) 
are given in Table [TT] in Appendix [A] 

We find that the parametrization ([1]) is sufficiently flexible to allow good fits to the 
combined data sets, with typical x 2 values on the order of one per degree of freedom, as 
shown in Table [H In some instances the parameters were found to have very large errors, 
indicating that the chosen data sets were unable to provide sufficiently strong constraints in 
those cases. Such parameters were fixed at suitable values, and included in the fits at the 
values shown in Table HT1 without errors. 



C. Target mass and higher twist corrections 



The standard method to calculate target mass corrections (TMCs) in DIS is the one based 
on the operator product expansion in QCD, first formulated by Georgi and Politzer 49]. 
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An alternative method based on co 
by Ellis, Furmanski and Petronzio 
the recent reviews of TMCs in Refs 



linear factorization in momentum space was developed 



5 Of. and extended by various authors [5ll-l53| (see also 



Ho4 



55]). All of these methods suffer to some extent 



from the threshold problem, whereby the target mass corrected structure function remains 



nonzero as x — > 1 



56 



581 ] . For the purposes of global fits, however, the region where the 



threshold effects become problematic is W < 2 GeV |58|. which is mostly outside of where 
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60j , the appearance of the threshold 



even the most liberal cuts in W and Q 2 are made 

According to the classic works of De Rujula et al. 
problem is attributed to the neglect of dynamical higher twist (HT) corrections, which scale 
as powers in 1/Q 2 . Moreover, higher order perturbative QCD corrections can also resemble 
power suppressed contributions at low Q 2 . Regardless of their origin, we can account for the 
various power suppressed corrections that are not included in a leading twist calculation by 
parametrizing them using a phenomenological multiplicative factor, as in the earlier C J fits 



F 2 (x,Q 2 



tLT, 



x,Q 2 



1 + 



C{x) 



(4) 



where denotes the leading twist structure function including TMCs. For simplicity we 
generically refer to the fitted 1/Q 2 term as a "higher twist" correction, and parametrize the 
higher twist coefficient function by C(x) = a HT x 6HT (l + c HT x), assuming it to be isospin 
independent (see, however, Refs. [6ll464|). 

With the inclusion of TMCs and a phenomenological HT correction, it was found in 
Ref. js] that the leading twist PDFs are essentially independent of the TMC prescription 
adopted; the HT parameters were able to compensate for the variations due to the different 
TMC algorithms. While the focus in the present work is on the leading twist PDFs, the 
interpretation and isospin dependence of the higher twist corrections are of intrinsic interest 
in their own right, and will be the subject of a future dedicated analysis. 



D. Nuclear corrections 



Since nucleons bound in a nucleus are not free, the nuclear structure function deviates 
from a simple sum of free proton and neutron structure functions, especially at large x where 
the effects of Fermi motion, nuclear binding, and nucleon off-shellness are most prominent. In 



S 



the nuclear impulse approximation the structure function of the deuteron d can be expressed 
as a convolution of the bound nucleon structure function and a momentum distribution f N / d 



of nucleons in the deuteron 



65 



67|, 



F 2 d (x,Q 2 ) = Yl fdzf N/d (z n )F 2 N (x/z,Q 2 ) + 5^F 2 d (x,Q 2 ), (5) 

N=p,n "* 

where the additive term 5^ oS ^F d (x, Q 2 ) represents nucleon off-shell and relativistic correc- 
tions that cannot be expressed through a one-dimensional convolution. The momentum 
distribution, or "smearing function", f^/d is computed in the weak binding approximation 
(WBA), which is appropriate for a weakly bound nucleus such as the deuteron, in terms 



of 



67, 



;he deuteron wave function, and implements nuclear binding and Fermi motion effects 
68j . At Q 2 — > oo it has a simple probabilistic interpretation in terms of the light-cone 



momentum fraction z = (M d /M)(p ■ q/p d ■ q) ~ (M d /M)(p + /pjr) of the deuteron carried by 
the struck nucleon, where p and p d are the four-momenta of the nucleon and deuteron, 
respectively, and M d is the deuteron mass. At finite Q 2 , however, the smearing function de- 
pends in addition on the parameter / y 2 = 1 + Ax 2 M 2 /Q 2 , which characterizes the deviation 
from the Bjorken limit. 

The nuclear corrections at large x depend partly on the strength of the high-momentum 
tail of the deuteron wave function, and we use several wave functions based on different 
nucleon-nucleon potentials to study the deuteron model dependence. We choose the high- 
precision AV18 |69J, CD-Bonn |70J and the relativistic WJC-1 [71] wave functions, which 
provide a representative spread of behaviors at high momentum. Note that the effects of 
the nuclear smearing corrections are not suppressed at large Q 2 , and must be considered at 
all scales wherever data at x > 0.5 are used 8]. 

The off-shell nucleon correction 5^°^F 2 is somewhat more model dependent, but several 



quark model based estimates of this have been made in the literature 



work we follow the approach adopted in our previous analysis js], which utilized the "mod- 
ified Kulagin-Petti" model. In this model, the corrections were related to the change in the 
nucleon's confinement radius in the nuclear medium, as well as the average virtuality of the 
bound nucleons, and constrained to give no net change in the structure function normal- 
ization. In contrast to Ref. [9J], however, here we further take into account the correlation 
between the nucleon "swelling" and the deuteron wave function. The combined effects in- 
troduce a theoretical uncertainty in the extracted PDFs, particularly for the d quark. A 
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72^74 



74(. In this 



detailed evaluation of the impact of this uncertainty will be presented in Sec. IIII1 



E. PDF Errors 



The global fits were performed using a standard \ 2 minimization algorithm due to Mar- 
quardt [751 ]. as described in Ref. 76]. This prescription produces a well-defined Hessian 
matrix typically in fewer than a dozen iterations and is both efficient and well-behaved. 
Where available, correlated systematic errors have been utilized. Error PDF sets were pro- 
duced using the resulting Hessian following the methods described in Ref. j^. Two error 
PDFs were produced for each of the eigenvalues of the Hessian, corresponding to positive 
and negative parameter increments, resulting in a total of 38 error PDFs for each of the 
three choices of nuclear corrections. 

Using these error PDFs it is possible to construct an error estimate 5a for some physical 
observable a via 

T 



5a 



^2 [ a ( a 2i~i) - °( a 2i 



(6) 



where a, denotes the set of parameters corresponding to the PDF set i, and T is a \ 2 
tolerance that will be described below. Alternatively, one can define asymmetric upper and 
lower errors 5a + and 5a_ based on the same error PDF sets using the expressions 



5a * 



T. 



19 



max 



cr[a 2 i-i) - a(a),a{a 2i 



a(a 



5a_ = T A 



10 



max 



a(a) - a(a 2 i-i),a(a) - a(a 2i ) ) 



(7a) 



(7b) 



where a(a) denotes the observable calculated with the "central value" of the fitted PDF set. 

The issue of the appropriate value for the x 2 tolerance T has been discussed at length in 
the literature. The value T = 1 is suitable in the case where all data sets are compatible 
with each other and all errors are Gaussian. In practice, various tolerance values are used 
in different PDF analyses, including T = 10 for the CTEQ6 PDFs 77J, T = \/50 in the 



MRST2002 fits [78j, and dynamically determined values in more recent fits 42|, |43[. In the 



CJ12 analysis discussed in this paper the error bands have been calculated using T = 10. 
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FIG. 2: Ratio of deuteron to isoscalar nucleoli structure functions F^/F^ for the three sets of nu- 
clear corrections considered in this analysis, corresponding to the CJ12min (dot-dashed), CJ12mid 
(solid), and CJ12max (dashed) PDFs. 

F. Theoretical errors 



A significant source of theoretical errors is the modeling of nuclear corrections for DIS 
on deuterium targets, which we refer to as "nuclear uncertainty" in short. Of course, since 
the deuteron can never be considered as composed of free nucleons, it is never physically 
reasonable to assume no nuclear corrections at all. We evaluate the nuclear uncertainty 
by considering nuclear corrections ranging from mild, corresponding to the hardest of the 
modern deuteron wave functions (WJC-1 [71]) coupled to a 0.3% nucleon swelling effect, 
to strong, corresponding to the softest of the wave functions (CD-Bonn [3]) and a large, 
2.1% nucleon swelling. An intermediate nuclear correction corresponding to the AV18 69 ] 
deuteron wave function and a 1.2% nucleon swelling effect is taken for our central fit. These 
corrections approximately span the same range as in our previous analysis 9]. 

The effect of the different nuclear corrections is illustrated in Fig. [2] for the ratio of 
F2 structure functions of the deuteron and an idealized isoscalar target consisting of an 
unbound proton and neutron. The upper, middle, and lower curves are the results for the 
mild (CJ12min), intermediate (CJ12mid), and strong (CJ12max) nuclear corrections. An 
intuitive way to understand the effects of these nuclear corrections on global fits is to consider 
the idealized free proton plus neutron target as being obtained by dividing the deuterium 
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target data by the ratio shown in Fig. [2j In the intermediate- £ region the idealized nucleon 
F2 structure function is larger than the original data, and the increase is largest for the 
maximum nuclear correction model. On the other hand, as one extends to values of x above 
about 0.75, the reverse is true: the results there lie below the data and are lowest for 
the mild set of corrections. 

In a global fit, it is the combination of PDFs and nuclear corrections which is constrained 
by deuterium data, not the PDFs alone. At large x, the u quark is well constrained by free 
proton DIS and other data, so that when fitting the corrected deuterium data (essentially 
the idealized FJ? oc u + d) the d quark will be the distribution most sensitive to the nu- 
clear corrections. Therefore, in the large- x region the d PDF will be smallest for the mild 
correction and largest for the strong corrections (see Fig. [7] in Sec. Ill Ij) . 

One way to evaluate the impact of the nuclear uncertainty on a given observable is by 
calculating it using the CJ12min fit corresponding to the smallest nuclear correction, then 
with the CJ12max fit corresponding to the largest correction. The values of the observable 
spanning these 2 extremes can be considered a fair representation of the theoretical nuclear 
uncertainty. PDF errors can then be added to the central values so calculated, and displayed 
as an outer error band, as done in Ref. js, jj. Alternatively, one could first calculate a PDF 
error band using the CJ12mid set, and display this on top of the 2 bands obtained with 
the min and max sets. The size of the resulting sidebands will then represent the nuclear 
uncertainty. 

III. RESULTS 

In this section the results of the CJ12min, CJ12mid, and CJ12max sets of global PDF 
fits are presented and compared with other modern PDF fits. Tables for the central PDF 
sets, the corresponding error PDF sets, and a user interface to read them can be found on 



the CJ 10] and CTEQ |79f web sites. 

The data sets used in the global fits are shown in Table [I] along with the x 2 values of 
the three fits for each of the data sets used in the analysis, as well as the total \ 2 values 
and the totals with the normalization contribution to the y 2 included. As is apparent 
from these values, the description of most of the data sets is independent of the size of 
the nuclear corrections. Indeed, the deuterium data are sensitive to the convolution of the 
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nuclear model with the d PDF; as the model is changed, the d PDF is altered by the fitting 
program to compensate. For observables such as the jet cross section, the modified d PDF 
can be compensated for by small changes in the gluon PDF, so the overall description of 
the data remains unaltered. One notable exception to this pattern is for the W asymmetry 
as measured by the CDF collaboration 22]. This observable is sensitive to the d/u ratio, 
and changes in the d PDF are not easily compensated by the u PDF, which is relatively 
well constrained by proton DIS data. The x 2 f° r the W asymmetry data does show a 
significant increase as the magnitude of the nuclear corrections increases beyond its middle 
value, indicating a preference for mild to medium nuclear corrections. 

In this regard it is interesting to compare our results to those of the recent analysis 



in Ref. 80], which included nuclear corrections for deuterium targets in DIS using a 4- 
parameter, Q 2 -independent phenomenological function with the parameters varied in the 
fit. The resulting correction factor, shown in Fig. 11 of Ref. 80], can be compared to those 
in Fig. |2] above. Their fitted form lies between the curves for the CJ12min and CJ12mid 
fits, as might be expected since these two fits have nearly identical values for x 2 , while the 
CJ12max value is higher. As noted above, much of the increase in \ 2 for the CJ12max set is 
due to the CDF W asymmetry data, which is also included in the fit of Ref. [8 0| . Although 
this comparison is not exact, since our nuclear corrections are Q 2 dependent [81] and those 



in Ref. 



80] are not, it is consistent with our observation that the nuclear model choices 



made for the CJ12min and CJ12mid sets are preferred by the data. 

The CJ12mid PDFs are shown in Fig. |3] at Q 2 = 100 GeV 2 with the PDF error bands 
calculated as described in Sec. Ill El on both logarithmic and linear x scales. The latter more 
graphically illustrates the behavior of the PDFs at large values of x, where the uncertainties 
from nuclear and finite-Q 2 corrections are greatest. The error bands are shown in more 
detail in Fig. HJ and compared to the CJ12min and CJ12max sets. It is clear that the 
effects of nuclear corrections are strongest on the d PDF, with the others showing little or 
no influence. In particular, the d quark PDF increases at large values of x as the magnitude 
of the nuclear corrections increases. This, again, is a reflection of the fact that it is the 
convolution of the nuclear corrections and the d PDF that is constrained by the deuterium 
DIS data, and not the d PDF alone. 

A comparison of the CJ12mid PDFs with several other sets is shown in Fig. |5l which for 
illustration includes the CT10 [43], MSTW08 42] and ABKM09 40] PDFs. The latter in 
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FIG. 3: Uncertainty bands for the u, d, d + u, d — u, s and g PDFs for the CJ12mid fit at 
Q 2 = 100 GeV 2 , shown on logarithmic (left) and linear (right) scales in x. Note that in the left 
panel the gluon is scaled by 1/10. 

particular are closest in spirit to the CJ12 PDFs in that similar kinematic cuts are imposed, 
and nuclear and finite-Q 2 corrections included, albeit using somewhat different prescriptions. 
One feature that is readily apparent in the first two panels is that the error bands for the u 
and d PDFs are reduced relative to those from the other sets. This is a direct consequence 
of the lower cuts on Q 2 and W 2 which allow DIS data to be fitted to larger values of x, 
approaching 0.9. Of course, when one considers the variations allowed by the different 
nuclear models, the error bands again increase, modestly for the u quark and substantially 
for the d quark. 

To further examine the role of the deuteron DIS data, a global fit was performed with 
the deuterium DIS points removed from the data set. The results are shown in Fig. [6] for 
the u and d PDFs relative to those from the CJ12mid set. Without the deuterium data the 
error band for the u PDF shows a modest increase beyond x ~ 0.7, while that for the d 
PDF shows a significant increase over most of the x range. It is important to note that the 
error band on the d PDF without the deuterium data exceeds the band which includes the 
nuclear uncertainties. This clearly demonstrates the usefulness of the deuterium DIS data, 
even in the presence of the nuclear uncertainties that its use introduces. 

For the strange quark PDF, the error band shown is somewhat smaller than for other 
sets. This is an artificial reduction, however, since we do not use the NuTeV dimuon DIS 
data [30(|, which are taken on heavy nuclear targets. Accordingly, the value of k in Eq. ([3]) 
is fixed, rather than fitted, and the error on the s + s combination is therefore proportional 
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FIG. 4: PDF uncertainties for the CJ12min (dot-dashed), CJ12mid (solid), and CJ12max (dashed) 
PDF sets relative to the reference CJ12mid set at Q 2 = 100 GeV 2 . 
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dashed), relative to the reference CJ12mid set at Q 2 = 100 GeV 2 . 



to that of u + d. In practice, we use the value k = 0.4 from earlier CTEQ fits |77j, |82j that 
employed the same choice for Q as in this analysis. Methods of constraining the s PDF 
without the use of heavy nuclear targets will be the subject of future investigation. 

The ratios of the d to u PDFs for the three CJ12 sets are shown in Fig. [71 which also 
compares the CJ12mid results with those from the other PDF sets in Fig. [5j Conventional 
parametrizations that treat the d and u PDFs with the same (1 — x) a2 form for the x — > 1 
behavior, have the characteristic that the d/u ratio tends either to zero or infinity as x 
approaches one. Conversely, the use of the parametrization given in Eq. fl2]) allows d/u to 
have any finite limiting value 1. This is clear from the results in Fig. [7(b), where the 

MSTW08 and ABKM09 d/u ratios tend to zero, while that for CT10 tends to infinity. This 
behavior distorts the error bands so that the MSTW08 ratio appears to have an anomalously 
small uncertainty, while the CT10 result is much larger. However, as the relative errors in 
Fig. [7(c) show, the MSTW08 and CT10 bands are similar, as would be expected, while the 
CJ12 and ABKM09 bands are somewhat reduced in size at larger values of x due to the 
additional high-x data used in those analyses. The ABKM09 band then diverges again from 
the CJ12 band because of the use of a conventional d quark parametrization. 

As the magnitude of the nuclear corrections increases, the value of the d/u intercept 
at x = 1 rises because of the mechanism discussed in Sec. Ill Dl The central values of d/u 
extrapolated to x — 1 are 0.012, 0.22, and 0.33 for the CJ12min, CJ12mid, and CJ12max 
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CJ12max (dashed) PDFs, (b) the CJ12mid (solid), CT10 (dot-dashed), MSTW08 (dashed) and 
ABKM09 (dot-dot-dashed) PDFs, and (c) relative to the central values of d/u for each PDF set. 



fits, respectively. Including the PDF errors we find 



d/u y 0.22 ± 0.20 (PDF) ± 0.10 (nucl), 



X— >1 



(8) 



where the first error is from the PDF fits and the second is from the nuclear correction mod- 
els. These values encompass the range of available theoretical predictions jl-5|. However, it 
is also clear that a relatively modest improvement in statistical precision and reduction of 
nuclear uncertainty would allow one to restrict the range of allowable physical mechanisms. 
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IV. CONCLUSION 



The CJ12 next-to-leading order PDFs presented in this study demonstrate the clear 
necessity of including nuclear corrections in global fits for PDFs when deuterium DIS data 
are included. The largest effect is on the d quark PDF, where variations can be observed 
for values of x > 0.5. The theoretical uncertainty accompanying these corrections is of the 
same order of magnitude as the PDF uncertainty, and the CJ12 PDF sets will enable one to 
assess the impact of nuclear effects on the error bands estimated for various observables. The 
impact of these corrections is important for observables that depend on PDFs at moderate 
to large values of x, such as the production of high mass states and cross sections at large 
values of rapidity. Examples of such effects for selected observables at the Tevatron and 
LHC were recently discussed in Ref. 

The CJ12 fits utilize an expanded data set obtained by relaxing the kinematic cuts to 
Q 2 > Q\ = (1.3 GeV) 2 and W 2 > 3 GeV 2 , and considering finite-Q 2 corrections such as 
target mass and higher twist effects in the theoretical calculations. The resulting significant 
improvement in the statistics of the DIS data at large x more than compensates for the 
nuclear correction uncertainty, resulting in a net improvement in the precision to which the 
d quark is extracted from experimental data. 

Further refinements of the CJ12 PDFs are anticipated as additional data become avail- 
able. One natural extension is to assess the constraints provided by the addition of new 
LHC data to the global fits, such as on the s quark distribution 3l|]. Additional constraints 
on other PDFs may be provided through Z rapidity measurements 83j, and high-p^ jet and 



direct photon production 



84|]. A dedicated 



function measurements at Jefferson Lab 185 



study of the impact of longitudinal structure 
on large-x gluons is also foreseen. The 



potential of large rapidity measurements in proton-proton collisions at the upgraded RHIC 
detectors, or the proposed AFTER@LHC experiment 87| can also be explored. 

As new and more precise measurements of observables sensitive to the c?-quark in the 



proton and less sensitive to nuclear corrections become available in the future [88H90(, com- 
plementing the existing W charge asymmetry data 22j, global QCD fits will become capable 
of constraining theoretical models of nuclear corrections in the deuteron. Not only will this 
reduce the nuclear uncertainty on the fitted PDFs, it will also provide a new avenue for 
progress in the theoretical understanding of high-energy processes involving nuclei. 
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Appendix A: Parameter values 



In this appendix we list the initial parameter values and their errors for the three sets of 
PDFs discussed in the text. 

TABLE II: Parameter values for the three CJ12 PDF sets at the initial scale Q = 1.3 GeV. The 
parameters without errors have been fixed by sum rules or other constraints. 



Parameter 


CJ12min 


CJ12mid 


CJ12max 


s-. U V 

a 


2.8468 


2.7684 


2.9343 


a l 


0.65468 ± 0.021379 


0.64839 ± 0.021146 


0.66220 ± 0.021519 


a 2 


3.5453 ± 0.017768 


3.5686 ± 0.016400 


3.5733 ± 0.017535 


a 3 


0.0 


0.0 


0.0 


a 4 


2.7201 ± 0.43223 


2.8970 ± 0.40045 


2.6616 ± 0.42657 


<2„ 
°0 


15.389 


21.515 


24.724 


d v 
V 


1.0296 ± 0.044922 


1.1004 ± 0.042154 


1.1328 ± 0.041967 


d„ 

a 2 


6.3767 ± 0.18600 


6.9525 ± 0.19054 


7.1420 ± 0.19474 


dv 

a 3 


-3.7930 ±0.10975 


-3.9659 ± 0.098409 


-3.9989 ± 0.094928 


d v 


5.8404 ± 0.28079 


5.9770 ± 0.24538 


5.9310 ± 0.23263 


b 


(0.078691 ±0.16487) x 10" 2 


(1.0379 ±0.16884) x 10~ 2 


(1.3464 ±0.21869) x 10~ 2 


c 


2.0 


2.0 


2.0 


u + d 
a 


0.12145 ± 0.0049067 


0.12154 ± 0.0048619 


0.12366 ± 0.0049009 




-0.23390 ± 0.0044525 


-0.23389 ± 0.0044146 


-0.23222 ± 0.0043787 


4 +d 


9.6939 ± 0.19012 


9.7127 ± 0.19172 


9.5842 ± 0.19752 


a 3 +d 


0.0 


0.0 


0.0 


4 +d 


23.689 ± 1.7400 


23.228 ± 1.7038 


21.882 ± 1.6494 


d—u 
a 


90.162 ± 21.177 


112.02 ± 25.643 


115.65 ± 26.686 




2.5411 ± 0.14278 


2.7048 ± 0.14772 


2.7408 ± 0.14923 


d — u 

a 2 


a| +d " + 2.5 


al +d + 2.5 


al +i + 2.5 


d — u 

a 3 


0.0 


0.0 


0.0 


d — u 
a 4 


-4.1657 ±0.19316 


-4.2848 ± 0.19532 


-4.1870 ±0.19143 


a 


46.911 


49.404 


51.157 


a{ 


0.61077 ± 0.032280 


0.62122 ± 0.033597 


0.63027 ± 0.034793 


a 2 


6.1691 ± 0.61981 


6.5242 ± 0.67025 


6.8007 ± 0.72727 




-3.5055 ±0.12506 


-3.5348 ± 0.13367 


-3.5298 ±0.14233 


< 


3.5292 ± 0.27765 


3.6436 ± 0.30591 


3.6777 ± 0.33240 


K 


0.4 


0.4 


0.4 
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